Bounding information leakage in machine learning
نویسندگان
چکیده
Recently, it has been shown that Machine Learning models can leak sensitive information about their training data. This leakage is exposed through membership and attribute inference attacks. Although many attack strategies have proposed, little effort made to formalize these problems. We present a novel formalism, generalizing setups previously studied in the literature connecting them memorization generalization. First, we derive universal bound on success rate of attacks connect generalization gap target model. Second, study question how much stored by algorithm its set bounds mutual between attributes model parameters. Experimentally, illustrate potential our approach applying both synthetic data classification tasks natural images. Finally, apply formalism different strategies, with which an adversary able recover identity writers PenDigits dataset.
منابع مشابه
Optimizing Leakage Power using Machine Learning
As transistor technology nodes continue to scale into deep sub-micron processes, leakage power is becoming an increasingly large portion of the total power. This has been true for many years now, ever since deep submicron processes became available. In addition, more recently, computing is becoming increasingly mobile, where minimal power is of paramount importance. As a result, companies are b...
متن کاملInformation Theory and Machine Learning
Machine learning techniques are becoming increasingly useful primarily with the rapid development of Internet. A variety of machine learning methods have drawn inspirations or borrowed ideas from information theory. In this paper, we present a survey of such interactions between machine learning and information theory. Four important areas of machine learning are examined from the perspective o...
متن کاملMachine Learning for Information Extraction
As an increasing amount of information becomes available in the form of electronic documents, the need to intelligently process such texts makes shallow text understanding methods such as Information Extraction (IE) particularly useful. IE has been restrictedly defined by DARPA's MUC program [MUC Proceedings] as the task of extracting specific, well-defined types of information from text in res...
متن کاملMachine Learning for Information Retrieval
In this thesis, we explore the use of machine learning techniques for information retrieval. More specifically, we focus on ad-hoc retrieval, which is concerned with searching large corpora to identify the documents relevant to user queries. This identification is performed through a ranking task. Given a user query, an ad-hoc retrieval system ranks the corpus documents, so that the documents r...
متن کاملInformation Visualization using Machine Learning
Data visualization is an important tool for discovering patterns in the data. Finding interesting visualizations can be however a difficult task if there are many possible ways to visualize the data. In this paper we present the VizRank method that can estimate visualization interestingness. The method can be applied on a number of visualization techniques and can automatically identify the mos...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2023
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2023.02.058